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Abstract  -  An  earlier  discussion  of  the  capacity  of  the  mean-square-constrained  Poisson 
channel  is  continued.  Using  a  theorem  of  Hoeffding,  it  is  shown  that  the  channel  informa¬ 
tion  capacity  is  the  same  with  or  without  an  on-off  keying  (OOK)  constraint  on  the  channel 
encoder  intensity,  affirmatively  resolving  the  conjecture  made  in  our  earlier  discussion. 
Thus  the  known  formula  for  the  information  capacity  of  the  OOK-constrained  channel 
applies  as  well  in  the  absence  of  an  OOK  constraint.  Adapting  arguments  used  by  Wyner  to 
address  the  peak-constrained  Poisson  channel,  it  is  also  shown  that  the  coding  capacity  with 
no  OOK  constraint  is  equal  to  the  corresponding  information  capacity.  This  establishes  that 
all  four  capacities  -  coding  and  information,  with  and  without  OOK-constrained  encoder  - 
are  equal. 

J  *  l  '  .  :•  . 

»  .  \ 


Supported  by  ARO  Contract  DAAL03-86-G-0022,  ONR  Contract  N00014-86-K-0039  and  NSF  Contract 
NCR87 13726. 


91  5  068 


DISTRIBUTION  STATEMENT  a 

Approved  for  public  release; 
Distribution  Unlimited 


UNCLASSIFIED _ 

SECURITY  CLASSIFICATION  of  this  PAGE 


1.  REPORT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 


2m  SECURITY  CLASSIFICATION  AUTHORITY 


REPORT  DOCUMENTATION  PAGE 


IB.  RESTRICTIVE  MARKINGS 


2D.  OECLASSIFICATION/OOWNGRAOING  SCHEDULE 


A.  PERFORMING  ORGANIZATION  REPORT  NUMBERIS) 


3.  DISTRIBUTION/AVAILABILITY  of  REPORT 

Approved  for  Public  Release: 

Distribution  Unlimited 


5.  MONITORING  ORGANIZATION  REPORT  NUMBERIS) 


6«.  NAME  OF  PERFORMING  ORGANIZATION  16 b.  OFFICE  SYMBOL  7*.  NAME  OF  MONITORING  ORGANIZATION 

(if  applicable > 


Department  of  Statistics 


6c.  AOORESS  (City.  State  and  ZIP  Code) 

University  of  North  Carolina 
Chapel  Hill,  North  Carolina  27514 


Ba.  NAME  OF  FUNOING/SPONSORING  8b.  OFFICE  SYMBC 

ORGANIZATION  (If  applicable) 

Office  of  Naval  Research 


8c.  AOORESS  Idly,  State  and  ZIP  Code) 

Statistics  5  Probability  Program 
Arlington,  VA  22217 


11.  TITLE  (include  Security  Clarification) 

Capacities  of  the  Mean-Square-Constrained 


12.  PERSONAL  AUTHORIS) 

M.R.  Frey 


13*.  TYPE  of  REPORT  13b.  TIME  COVERED 

TECHNICAL  from _  to _ 


16.  supplementary  notation 


7b.  ADDRESS  (City,  State  and  ZIP  Code) 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 

N00014-86 

-K-0039 

|  10.  SOURCE  OF  FUNDING  NOS. 

PROGRAM 

PROJECT 

TASK 

WORK  UNIT 

E  lement  no. 

NO. 

NO. 

_ 

NO. 

DATE  OF  REPORT  (Yr.,  Mo..  Day )  15.  PAGE  COUNT 

1990  Julv  l  ? 


17. 

COSAT1  COOES  | 

FIELD 

GROUP 

SU8.  GR.  1 

18,  SUBJECT  TERMS  (Continue  on  ntverte  if  neceuary  and  identify  by  block  number ) 

Poisson  channel.  Shannon  theory,  channel  capacity 


19.  ABSTRACT  (Continue  on  reverse  if  necerary  and  identify  by  block  number) 

An  earlier  discussion  of  the  capacity  of  the  mean-square-constrained  Poisson  channel 
is  continued.  Using  a  theorem  of  Hoef fding ,  it  is  shown  that  the  channel  information 
capacity  is  the  same  with  or  without  an  on-off  keying  (OOK)  constraint  on  the  channel 
encoder  intensity,  affirmatively  resolving  the  conjecture  made  in  our  earlier  discussion. 
Thus  the  known  formula  for  the  information  capacity  of  the  OOK-constrained  channel  applies 
as  well  in  the  absence  of  an  OOK  constraint.  Adapting  arguments  used  by  Wyner  to  address 
the  peak-constrained  Poisson  channel,  it  is  also  shown  that  the  coding  capacity  with  no 
OOK  constraint  is  equal  to  the  corresponding  information  capacity.  This  establishes  that 
four  capacities  -  coding  and  information,  with  and  without  OOK-constrained  encoder  - 
are  equal. 


TITLE  CONT. :  Poisson  Channel 


20  oistribution/availasility  of  abstract 

21.  abstract  security  class 

F  1  CATION 

unclassifieo/unumiteo  ^  same  as  rpt.x53  otic  users  □ 

22».  NAME  OF  RESPONSIBLE  INOIVIOUAL 

?7r*  TP  |  *J’  -  veer, 

(Include  Area  Code ) 

22c  OFF  ICE  SYMBOL 

C.R.  Baker 

(919)  962-2189 

DD  FORM  1473,  83  APR 


EDITION  OF  1  JAN  73  IS  OBSOLETE. 


UNCLASSIFIED _ _ 

SECURITY  CLASSIFICATION  of  this  PAGE 


Introduction 


We  continue  the  discussion  of  the  capacity  of  the  mean-square-constrained  Poisson  channel  with 
causal  feedback  and  general  channel  base  measure  from  [5],  The  Poisson  channel  is  a  continuous-time 
additive  noise  model.  In  the  model,  the  channel  output  I  is  a  stochastic  process  Y,=Xt+N,  where 
Af=(N,}osisr  is  the  channel  noise  and  X={X,}osist  is  the  transmitted  signal  into  which  is  encoded  the 
message  0={0,  }os<sr  (assumed  to  be  independent  of  the  noise  process  N .)  Here,  X  and  N  are  Poisson- 
type  point  processes  [8]  with  respective  compensators 

<  t 

A,=jxsb(ds),  B,  =jU>(ds)  =  \b([0,t)) 

0  0 

for  all  t  e  [0,7].  b  is  called  the  channel  base  measure  and  is  assumed  to  be  finite;  for  convenience  we 
assume  =  1  where  bj  =b{[0,T])lT .  Also,  the  noise  int'nsity  X>0  is  considered  as  a  real  constant.  A 
mean-square  constraint  E[x,2]<P2  is  imposed  on  the  encoder  intensity  Xi-  Xt  is  also  restricted  to  be 
predictable,  allowing  only  causal  feedback.  For  more  on  the  predictability  restriction  and  mean-square 
constraint  and  for  further  general  description  and  discussion  of  the  Poisson  channel  model,  the  reader  is 
invited  to  review  [5], 


A  special  case  of  both  practical  and  theoretical  interest  is  that  in  which  the  encoder  intensity  X:  is 
constrained  to  switch  between  only  two  values.  In  this  situation  an  "on-off"  keying  (OOK)  constraint  is 
said  to  be  imposed  on  the  encoder  intensity  in  recognition  of  the  fact  that  when  uie  range  of  x,  is  res¬ 
tricted  to  just  two  values,  then  one  of  these  values  should  be  zero  (to  minimize  the  effective  channel 
noise  intensity.)  In  an  actual  implementation  of  such  an  encoder,  transitions  of  Xt  between  its  zero 
value  and  its  second  (positive)  value  might  typically  be  accomplished  by  turning  on  and  off  a  power 
source.  Hence  the  nomenclature  "on-off  keying".  We  consider  the  Poisson  channel  with  and  without  an 
OOK  constraint  on  the  encoder  intensity. 

Information  capacity  is  defined  in  terms  of  the  average  mutual  information  /r[0,T]  in  the  mes¬ 
sage  and  channel  output  processes,  0  and  Y  over  the  interval  [0,T].  Let  p«,  Hr,  and  Hor  be  the  marginal 
and  joint  measures  induced  by  the  message  and  output  processes,  0  and  Y,  on  the  spaces  Se,  Sr,  and 
St>xSr  where  Se  and  Sr  are  the  spaces  of  trajectories  of  0  and  Y  over  the  interval  [0,7].  Write  the 
induced  product  measure  as  tw.  Then,  the  average  mutual  information  in  0  and  Y  over  the  interval 
[0,7]  is  [9] 


/7'[0,n  =  E 


d  (lexc 


provided  otherwise  /r[0,y]  =  °°.  Expressions  exist  for  the  average  mutual  information  over 

the  interval  [0,7]  in  the  Poisson  channel  with  base  measure  b  and  channel  output  intensity  q,.  Under 
the  conditions  of  the  channel  model 


lT[Q,Y]  =  E 


|(q,lnq,-q,lnqt)6  (dt) 


0) 


where  q,  =x<  +  X  is  the  channel  output  intensity  and  q,  is  the  predictable  version  of  the  mean  of  q, 
conditioned  on  the  path  of  Y  up  to  time  t .  The  channel  information  capacity  is 

C**,  -  sueP  S“P  y/r[0,H 


where  0  is  any  jointly  measurable  process  defined  over  the  interval  [0,7]  and  x  =  X(8-7)  is  any  predict¬ 
able  functional  satisfying  the  mean-square  constraint  E[X/2^E2].  We  write  <F§?o  and  CWo,  respectively, 
for  the  information  capacities  with  anti  without  CKJK-constrained  encoder  intensity.  To  emphasize  the 
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dependence  of  the  information  capacity  on  the  channel  parameters  \  and  P ,  we  use  the  notation 

C0im  =  D(k,P),  Cg&  =  Doa,P). 

The  functions  D(k,P)  and  D„(L,P)  are  easily  shown  [5]  to  be  first-order  homogeneous;  i.e., 
D(z\,zP)  =  zD(\,P)  and  D0 (z X,zP)  =  zD0  (k, P ).  Thus  to  generally  determine  the  information  capaci¬ 
ties  CtNK,  and  C£?o,  one  need  only  find  expressions  for 

D(0,P),  Do(0,P),  D0(1,P). 


In  our  initial  treatment  [5]  of  the  Poisson  channel  with  a  mean-square  constraint,  a  simple  for¬ 
mula  was  obtained  for  the  information  capacity  for  the  special  case  of  zero  noise  intensity.  There  it  was 
shown  that 

D(0,P)  =Do(0,P)  =  —P  . 

e 


The  case  of  nonzero  noise  intensity  was  found  to  be  less  tractable  and  an  explicit  expression  for  the 
information  capacity  was  found  only  for  the  special  case  in  which  the  encoder  intensity  is  OOK- 
constrained: 


D0{\,P) 


sup  P2 .  afc(l/a)  +  I 
a2*  (l  ICDZP1  a  p2k(a/P2)/a+ 1 


(2) 


where 


k(x)=- 

e 


1.1 

x 


*+i 


-  X 


The  remainder  of  the  paper  is  comprised  of  three  sections.  The  first  uses  a  theorem  of  Hoeffding 
[7]  to  show  that  D(l,P)=D<J(l.f>).  allowing  (2)  to  be  used  to  calculate  the  capacity  Ctsm=D(X,P). 
The  second  section  introduces  coding  capacity  and  shows  it  to  be  equal  to  the  information  capacity. 
Some  final  remarks  are  made  in  the  last  section. 
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Information  Capacity 


The  purpose  of  this  section  is  to  show  that  the  information  capacities  C^-po  and  C§?o  are  equal. 
This  is  known  to  be  true  in  the  special  case  X=0  of  zero  noise  intensity.  Thus,  showing  Cinfo=C£?o 
amounts  to  showing 

D(l,P)  =  D0(l,P).  (3) 

(3)  is  proved  in  part  by  an  application  of  a  theorem  of  Hoeffding  [7]  on  the  extrema  of  expectations. 
We  prepare  the  way  for  the  use  of  this  theorem  with  some  notation  and  several  lemmas. 

Let  B  (P)  denote  the  class  of  nonnegative  random  variables  X  satisfying  E  [X2]  =P2.  Also  let 
B*(P)c:B(P)  denote  those  members  X  of  B(P)  which  are  discrete.  We  use  Bk(P)  to  denote  the 
class  of  X  e  B*(P)  with  no  more  than  k  atoms.  We  also  freely  use  B(P),  B*(P),  and  !Bk(P)  to 
denote,  respectively,  the  classes  of  distribution  functions  generated  by  X  eB(P),  X  e  IB ’(P ),  and 
X  e  B k(P ).  The  distance  d(F ,G )  between  two  distribution  functions  F  ,G  e  IB (P)  is  defined  to  be 

d(F,G)  =  Zf0\F(x)-G(x)\. 

We  also  define  the  functional  <j>(F  )  for  F  e  B  (P )  by 

<KF)  =  J(x  +  l)InCx  +  l)F(«fcc). 

0 

<|>(F)  exists  and  is  finite  for  all  F  e  B  (P ).  Indeed,  <{ >(F)<P2+P  for  all  F  e  B  (P). 

Lemma  1:  For  every  e>0  and  F  e  B(P)  there  exists  a  distribution  function  F*e  B*(P)  such  that 

d(F,F‘)  <  e.  (4) 


Proof.  (4)  holds  for  F*U)=|_ I"— lF(x  +  A)J  /  ["— ]  for  some  choice  of  A>0. 

e  e 

Lemma  2:  For  every  e>0  and  F  e  B(P),  there  exists  a  5>0  such  that 

l<j>(F)  -  <|>(G)I  <  e 


for  all  G  e  B  (P )  satisfying  d(F  ,G)<  8. 


Proof.  Let  F  e  B(P).  Let  G  e  B{P)  be  any  distribution  function  such  that  d(F ,G)< 5.  Associate 
with  F  and  G  the  truncated  distribution  functions 


Fb(x)  = 


Gb  (*)  = 


h(x),  x<B 
1  1,  x>B  ’ 

\G(x),  x<8 

1  1,  xZB  ■ 


The  idea  for  the  proof  is  to  show  that  4»(F)  and  <KFB )  differ  by  an  arbitrarily  small  amount  for  a  large 
enough  choice  of  B .  Likewise  for  <t>(G )  and  <p(Gs ).  Next,  FB  and  GB  are  approximated  by  discrete  dis¬ 
tribution  functions  F"  and  G"  with  n  equal  size  jumps.  We  show  that  the  differences  I <>(F s )  —  <J)(F " )  I 
and  l<j>(Ga)-<j>(G")l  can  be  made  arbitrarily  small  by  improving  (letting  n  — >°°)  the  approximations  of 
Fa ,  Gb  by  F",  G".  Finally,  to  complete  the  proof,  it  is  shown  that,  for  5->0,  i<p(F ” )— 4>(G " ) I  is 
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arbitrarily  small  uniformly  in  G  for  ail  G  within  distance  8  of  F. 

We  have 

l<K^)-<KG>!  <  WF'I-Wb) I  +  l<KG ) - <KGs ) I  +  l<t>(Fa)-<t>(F")l 

+  |<j>(Gfl)-<KG")l  +  l<KG*)-<!>(F*)l 

where,  for  positive  integers  n,  F“  and  Gn  are  discrete  approximations  of  FB  and  GB  :  Fn  and  G"  are, 
respectively,  the  right-continuous  versions  of  the  functions 

-r*F(-)l,  -fnG()l. 
f\  n 

We  have 

|<t>(F)-<KF8)l  <  j(x  +  l)ln(jc  +  l)F(dx)  -  (B  +  l)ln(B  + 1)(1  -F(5 )) .  (5) 


The  integral  on  the  RHS  of  (5)  is  finite  so  B  can  be  chosen  so  that 

so 

f(x  +  l)ln(x  + 1  )F{dx)  <  et 


(6) 


for  any  e, >0.  Now 


0  <  (fi  +  l)ln(S  +  1)(1  -  F  (B ))  <  J  (x  +  ljln(x  +  1  )F  (dx ) , 


so 

|<t>(F)-<KFfl)l  <e, 

for  B  large  enough.  We  choose  B  so  that  (6)  and  a  similar  statement  for  G  hold.  Then 
|<D(F)-<KG)I  <  !<K/rg)-<j>Gr',)l  +  !4>(<Gb )  —  <b(G " ) I 
+  l4i(G")-<t)(F“)l  +  2£,. 

Define  F,  to  be  the  distribution  function  F"(x)=|nF(x)J/n.  Let  (x,  ,  i  =0,1, ...,/* }  be  the  locations  of 

the  jumps  of  Fn  and  F";  the  jumps  of  F*  are  located  at  {x,  ,  i  =0, 1 . n-1)  and  the  jumps  of  F„  are 

located  at  {x; ,  i  =  1,2 . n }.  Then 

l<KF8)-(J)(F")|  <  (j)(F,)-<j)(F") 

=  —  £(*;  +  1)ln(jCi  +  !)  ~  —  +  + 1) 

ni= i  n  i=0 

=  —  (x„  +  l)ln(x„  +  1)  -  — (x0+  l)ln(x0+  1) 
n  n 

<  -(B  +  l)ln(B  +  1). 
n 

Consider  n  >T(B  +  l)ln(B  +l)/e21.  For  any  such  n  and  given  any  e2>0,  we  have  \^(FB)-^(F’')'\  <e2. 
Given  our  choice  of  B ,  the  same  n  gives  I4>(G8)-0(G")I  <e2.  Therefore 

I4>(F )  —  <}»(G )  I  <  1<KG',)-4>(^'*)I  +  2c,  +  2e2 

for  sufficiently  large  n  and  B .  For  d  (F  ,G )  <  5, 

!<KG")-d>(F»)t  <  <b(Fns)-<t>(F") 
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where  Ff(x)=Fn(t x  -8).  Thus 

[0(G',)-0(F")|  <  -£(*  +8+  l)lnfc  +8+  1)  -  -"£(*,  +  Dlnfc  +  D 

n  i=\  n  i=0 

Choose  8  so  that 

8  <  iSiS.fe-JEi-i) 

and  define  xn+! = xn  +  8.  Then  xi+1  -  5  >  x,  for  all  i  =0, 1, ....  n .  Thus 

I0(FB)-0(G,,)I  ^  —  £(xi+i  +  l)ln(xi+1  +  l)-  —  X(x, +l)In(x, +1) 

n  i=l  n  i= o 

=  —  fa  +  l)ln(jc,  +  1)  +  (xB+1  +  l)ln(xB+1  +  1) 
fl  L 

-  (x0+l)ln(x0+l)  -  (xi  +  IMxj  +  I) 

<  — (fl  +8+l)ln(fl  +8+1). 
n 

Let  n  =f(S  +2)ln(fl  +  2)/e3] .  For  sufficiently  small  8e(0,l)  and  all  G  e  B(P)  satisfying  d(F,G)<8,  fl 
and  n  can  be  chosen  large  enough  to  give 

|0(F)-0(G)l  S2£,  +4e3 

for  any  £  =  2^  +4e3>0.  This  is  what  we  sought  to  prove. 

We  are  now  in  a  position  to  use  Hoeffding’s  theorem. 

Lemma  3:  Define  /  [X  ]  =  E  \X  lnX  ]  -  £  (X  ]ln£  [X  ].  Then 

X  eX)/[*  + 1]  =  X  + 11  •  (?) 

Proof.  Writing  p.=£[X]  and  F  for  the  distribution  function  of  X,  we  have 

X  imp)1  [X  +  «  =  ifp  *  !2P(P)  0(F)  -  01+  l)ln(n+  1)  , 

1  ^  [n=£[X] 

'  ' 

X.jjWt*  + 1]  =  ifp  *  <S>)  0(F)  -  0i+  l)lnOx+ 1)  . 

Lemmas  1  and  2  show  that  0  satisfies  the  conditions  required  in  Hoeffding’s  theorem  [7,  Theorem  2.1]. 
The  proof  is  complete  since  for  constrained  first  and  second  moments,  Hoeffding’s  theorem  states  that 

sup  sup 

X  eB(P) 0(F )  =  XeZB3(£)0(F). 

(i  =  £[X]  U=£[X] 

Lemma  3  is  enough  to  prove  that  three-level  encoder  intensities  are  sufficient  to  realize  channel 
information  with  rate  approaching  Coro-  Two-level  encoding  -  the  on-off  keying  case  -  can  be  shown  to 
be  sufficient  by  inspection  of  I[X  + 1]  for  X  e  ffl3(F).  We  express  X  e  B^P)  by 


0,  w.p.  \~P\~P2 
X  =  -  a, ,  w.p.  px 
a2,  w.p.p2 
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for  nonnegative  numbers  pi,p2,al,a2  such  that  px+p2<\  and  pxax  +  p-fi2  <P2.  The  phrase  "with 
probability"  is  abbreviated  by  "w.p."  here.  Consider  the  4-tuple  (p1,p,2,ai,a2)  as  a  point  **e  ^4  belong¬ 
ing  to  the  set 

A  =  {TeEA:  pu  p2,ax,a2>0,  px  +p2<  1,  p,a,2  +p2a2  <P2} . 


f  ( zf)=I[X  + 1]  defined  on  A  is  continuous.  To  be  closed,  the  set  A  lacks  only  the  limit  points  associ¬ 
ated  with  sequences  of  points  (z*  =(pu,P2* with  third,  fourth,  or  both  coordinates 
a  u, a 2k  as  i  Therefore  the  supremum  of  / (?)  over  ?e  A  (equivalently,  the  supremum  of 
I[X  + 1]  over  X  £  IB  2(P ))  is  to  be  found  as  the  value  of  /(?)  for  some  particular  value  (or  values)  of 
?e  A  or  as  the  limit  of  a  sequence  {/(?*)}  where  {z* )  is  a  sequence  of  points  z*  e  A  with  one  or  both 
coordinates  au,a2*-»°°-  We  consider  three  cases  exhausting  the  possibilities  for  the  supremum  of 
f(f):  the  case  in  which  at  least  one  coordinate  of  ?  is  zero,  the  case  in  which  the  supremum  is  the 
limit  with  ax  — »«>  or  a2—>°°,  and  the  case  in  which  no  coordinate  of  ?  is  zero. 

Let  ?0 e  A  have  a  zero  coordinate  and  suppose  the  supremum  of  /  (?)  over  A  is  f  (?0).  In  this 
case,  X  actually  has  only  two  atoms  and  two-level  encoding  suffices  to  obtain  channel  information  rates 
approaching  capacity. 


Suppose  the  supremum  is  the  limit  of  a  sequence  [f  (z*)}  where  the  coordinate  alk  of  Tk  tends  to 
infinity  and  a  2*  is  bounded.  Then 


lim  /  (z* )  =  lim 

k—**> 


S  lim 


<  lim 


P\k(a  1*  +  l)ln(au  +  1)  +  Pufcu  +  lMflat  +  1) 

~  (Plkalk  +P2ka2k  +  +P2ka7k  +  1) 

( P2laxk)(axk  +  l)In(<*u  + 1)  +  Pn(aik  +  l)ln(a2*  +  1) 
~  (Plia2k  +  l)ln(P2*a2t  +  1) 

P2*(^2*  +  l)ln(a2*  +  1)  -  (P2tfl2*  +  l)ln(P2*fl2*  +  1) 


This  shows  the  supremum  to  be  dominated  by  the  limit  of  a  sequence  of  values  of  7[Xt  +  l]  where 
Xk  e  IB2(P).  The  symmetric  case  in  which  -»<*>  while  axk  is  bounded  and  the  case  where  both 
coordinates  tend  to  infinity  are  handled  similarly. 

Finally,  consider  the  case  in  which  the  supremum  of  / (?)  is  given  by  / (t0)  for  some 
?,eA  nZE+  where  ZE+=  {?e  Ek:  px,p2,ax,a2>0).  Then  the  supremum  is  actually  the  maximum  and 
we  have  the  nonlinear  programming  problem:  Maximize  the  objective  function  / (?)  over  ?e  E*  sub¬ 
ject  to  the  constraints  gt(?)<0  and  g2(f)<  0  where 

8  l(?)  =  *1*32  +  *2*4  - 

8 2(2*)  =  *i  +  z2  -  I- 


In  the  terminology  of  nonlinear  programming,  E  *  is  the  feasible  set  and  any  feasible  point  satisfying 
constraints  gx(T)<0,  g2(f)<0  is  said  to  be  attainable.  In  the  present  case,  the  set  of  attainable  points  is 
exactly  the  set  A  nE+.  Also  in  the  present  case,  the  feasible  set  is  open  and  /,  gx,  g2  meet  required 
differentiability  conditions  [1],  so  any  given  attainable  point  t„  which  maximizes  /  must  satisfy  the 
Kuhn-Tucker  conditions  [1J: 

Vf(K)  -  uxVgx(T0)  -  u2Vg2(T0)  =  0,  (8) 

Uigi(to)  =  0,  1  =  1,2  (9) 

u,  >  0 ,  (  =  1,2 


where  the  Lagrange  multipliers  u ,,  u2  are  unique.  It  follows  from  (9)  that  u2  =  0  or  g2(?o)  =  0.  The  latter 
case  g 2(z^)=0  reflects  the  situation  in  which  Pi+p2=I;  a  situation  in  which  X  has  only  two  3toms.  If 
8 2(K)*0  then  u2~ 0-  Then  in  (8)  we  have 
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(ax  +  l)ln(a!+  1)  -  ax  -  axln(pxax+p2a2  +  1)  -  uxa2  =  0 ,  (10) 

(a2+  l)ln(a2+  1)  -  a2  -  a2\n('plal+p2a2+  1)  -  uxa2  =0,  (11) 

/>iln(a!  +  1)  -  p1ln(pla:+p2a2  +  1)  -  2 uiplal  =0,  (12) 

p2ln(a 2+  1)  -  p2ln(p,a1+p2a2  +  1)  -  lulp2a2  =  0.  (13) 


Multiplying  (12)  by  ax!p\  and  subtracting  the  result  from  (10)  gives  ui  =  t(ax)  where 

,,  ,  x  -ln(x  +  1)  n 

^(x)= - \ - ,  x>0. 

x 

A  similar  calculation  using  (11)  and  (13)  gives  ux  =  Z1(a2).  The  inverse  £-1  exists  since 

£'(*)  =  "TT^Y  (Jt  +  l)ln(x  +  l)-  yx2-x  >0 

for  x  >0.  «]  is  unique  for  a  given  T0  so  ax=a2 ,  implying  that  X  e  B2(P).  Thus,  in  this  last  case  as 
well,  two-level  encoding  suffices.  This  proves 

Lemma  4  : 

x!m\p)nx+w-xil\P)nx^\]. 


Theorem  1:  Suppose  the  encoder  intensity  Xt  °f  a  Poisson  channel  with  noise  intensity  X  is 
predictable  and  mean-square-constrained  £[%,2]</>2.  For  this  channel 

f  (npo  =  CS?o  =  D0  (X,P ) 

so  that 

D{Q,P)=  2P 

e 


and,  for  X>0, 


D(k,P)  =  —In  y/fl)  +  X 

asA  a  P2k(ka/P2)/a+X 


with  A  =  [a0 ,«)  and  a0  satisfying  a2k  (Xa, )  =  P2. 


Proof.  The  case  X=0  is  known  [5]  to  be  true.  Also  in  [5],  it  was  shown  that  C§?o  =  D(X,P)  as 

given  in  (14)  for  X>0.  Obviously,  C(NFo>fS?o  so  all  that  remains  to  be  shown  is  that 

Cinfo  ^  L)0  (X,P)  (15) 

for  X>0.  D  and  D0  are  homogeneous  [5]  so,  actually,  it  is  enough  to  show  (15)  for  the  case  X=l. 
From  (1)  and  Jensen’s  inequality 

It[Q,Y]  < /(x,  +  l]6 (10,71)  =  /[x,  +  1]7\  (16) 

Here  in  (16)  x<  is  just  any  random  variable  belonging  to  IB  (P).  Therefore, 

£Wo  ^  x  iW(P)^x  +  1]  =  x  +  1]  (17) 

where  the  equality  in  (17)  follows  from  Lemmas  3  and  4.  It  follows  from  [5,  Lemma  4]  that  the  RHS 

of  (17)  is  D0(\,P).  Hence  the  proof  is  complete. 


f 


Coding  Capacity 


Coding  capacity  is  the  threshold  on  transmission  rates  below  which  essentially  error-free  com¬ 
munication  is  possible.  Coding  capacity  is  founded  on  ideas  of  channel  codes,  decoding  schemes,  anu 
decoding  error  probability.  A  code  ( M,T,Pt )  for  the  mean-square -constrained  Poisson  channel  is  a  set 
of  M  equally  likely  nonnegative  waveforms  x«(0.  fe  [0,7],  m  =  1.....A/  corresponding  to  unique  mes¬ 
sages  and  satisfying  the  constraint 


Let  SY  be  the  space  of  trajectories  of  Y  on  [0,7],  A  decoding  scheme  is  a  mapping  DSY  — » ( 1,2 . M  } 

where  the  integers  l,2,...,,Vf  are  labels  for  members  of  the  set  of  possible  messages.  The  error  probabil¬ 
ity  associated  with  D  is 

P,  =  ^ZP{D(YTo)*m\Xm  O) 

where  y^e  SY  denotes  the  path  (F,:  te  [0,7]).  A  code  (M  .7 ,P,)  has  rate  R  =(  l/7)laVf .  A  code  rate 
R  is  said  to  be  achievable,  if  for  all  e>0,  there  exists  a  code  ( M  ,T  ,P ,)  whose  parameters  satisfy 
M  >eRT  with  Pt  <i  for  7  sufficiently  large.  The  coding  capacity  CCOdino  is  the  supremum  of  achiev¬ 
able  rates.  Cmoino  is  the  coding  capacity  of  the  OOK-constrained  Poisson  channel  in  which  it  is  addi¬ 
tionally  required  that  the  encoded  message  waveforms  XmO).  m  =  1 . Vf  of  any  code  i.Vf ,r,R,)  take 

on  no  more  than  tv’C  values. 

Coding  capacity  can  be  related  to  information  capacity  using  Fano’s  inequality  [4], 

Lemma  5\  Ccodcsg—  limsup  Clvfo. 

T 

Proof.  Tiio  proof  uses  Fano’s  inequality  [2),  [4j  in  a  straightforward  adaption  of  proofs  [2).  [6 i 
given  in  the  context  of  other  channel  models. 


Applying  Lemma  5  to  the  Poisson  channel  with  and  without  the  OOK  constraint,  we  have 

Cx  ^fOOK  ^  <r*OOK 

CODING  -  ^INFOt  U'COOCNo  -  ^  INFO  • 

We  showed  in  Theorem  1  that  Com  =  €,<^  and,  of  course,  ^coding  ^  ^coding  so,  to  show  that  the  four 
capaciues 

iT  P  OOK  f  OOK 

'*■'  INFO  >  ^  INFO  .  L  CODING  »  CODING 

are  equal,  we  need  only  show  that  CcS^ngSCSkS.  This  is  accomplished  using  an  argument  employed 
by  Wyncr  [10]  to  address  the  peak-constrained  Poisson  channel. 

Wyner’s  approach  [10]  is  to  consider  the  Poisson  channel  as  a  binary  discrete  mcmorylcss  chan¬ 
nel  [6].  The  message  is  assumed  to  be  a  stream  of  Is  and  Os  produced  at  the  rate  of  one  symbol  0, 
each  A  seconds.  The  encoded  message  waveform  XmO )  is  assumed  to  be  constant  in  the  intervals 
((n-l)Api A],n  =  1,2,...  taking  on  only  the  values  x »  =  0  or  X*  ~a<  0  if  the  message  symbol  is  0,  =0 
and  a  if  the  message  symbol  is  0,  =  1.  The  receiver  decoder  identifies  the  message  symbol  as  1  if 
Yn  s  ~  y((1-i)A=  1  The  message  symbol  is  identified  as  0  otherwise.  The  discrete  channel  created  by 
using  the  Poisson  channel  in  this  way  lias  channel  yansilion  probabilities 

Poi  =  7(7,  =  110,  =0}  = 


« 
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Pi,  = /’{!'.  =  110.  =  1}  =  <k  +  a)e-^+a*. 

The  coding  capacity  of  the  OOK-constrained  Poisson  channel  is  lower  bounded  by  the  coding  capacity 
of  the  binary  channel  with  these  transition  probabilities.  Coding  capacity  and  information  capacity  are 
equal  16]  for  the  discrete  memoryless  channel  so 

ACc^s^P/te.n  as) 

where  G  =  P 2/4 2  and  q  =P  {0=1].  In  (18)  and  in  the  definition  of  7,  the  subscript  n  on  9  and  Y  is  no 
longer  relevant  and  has  been  dropped.  The  restriction  q  <a  in  (18)  accounts  for  the  mean-square  con¬ 
straint  £[x2]  =  qA2<P2. 

Define  h  (x )  to  be  the  binary  entropy  function 

h(x)  =  -x\nx  -  (1 -x)ln(l -x) 

and  let 

f(q,A)  =  A (poj(l  ~q)  +  pnq)  -  qh(pVl)  -  (1  -q)h(pm). 

Then 

;usP/[9,n  =  f/0/[X.n 

=  ^fa{H[Y)-H[Y\X]\ 

=  *“0  [MPoiO -q )^Pn<?)  -  qhiPu)  -  (1  -q)h(poi)\ 

=  f/0/(7,4).  (19) 

Let  s  =  XlA .  For  small  A,  we  can  w-rite 

qpw  +  (l -q  )po\  =  sA  Ae""*A(l -q)  +  q(s  +  1)4  Ae'*^ +  1) 

=  r4A(l  —  <7 )[  1  -54A  +  <?(A)]  +  q{s  +  1)4  A[1  -A  A(s  +  \  )~o  (A) 

=  sA  A(  1  -  q )  +  qsA  A  +  qA  A  +  o  (A) 

=  .4  A(r  +  q )  +  o  (A) . 

We  have  h  (x )  =  -x  lux  +  x  +  0  (x :)  as  x  -» 0  so 

f(q,A)  =  -{p<n(\-<l)+Puq)\n(pQ](\-q)  +  pnq) 

+  PoiO -<7)+Pii7  +  0(pm(\ -q)J-pn‘l) 

+  qp ninpii  -  ii  +  0(p n ) 

+  0-7)PoilnPot  -  (1  -<?)/> oi  +  0(p 0:i ) 

=  -(Poi(l-7)  +  Pn<7  )ln(poi(l-7)-'-Pii<7) 

+  <7Pulnpn  +  (1  -<?)poilnpoi  +  o(A) 

=  ( 1  -  q  )s4  Aln(r4  A)  *  q  (s  +  1  )4  Aln((r  +  1  )A  A) 

-(5+^)4  Aln((r  +  q  )4  A)  +  o  ( A) 

=  4  A^(l  -7)slnr  +  (1  -<7)sln(4  A) 

+  q  (5  +  1  )In(5  +  1 )  +  q  (s  +  1  )ln(4  A) 

-  (5+7  )ln(s  +  7 )  -  (s  +  7  )ln(s  +  7  )ln(A  A)  +  0  (A) 

=  4  A  (1  -7  )r  Ins  +  7(5  +  l)ln(r  +  1)  -  (s  +q)ln(s  +7  )j  +  o  (A) .  (20) 
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t 


When  combined,  (18),  (19)  and  (20)  give 

CS85no  a  *fa  a[(1-?)51iu  +  q(s  +  l)In(j  +  1)  -  (s  +  q  )In(j  +  q )]  . 

We  have 

(l-<?)shu  +  q(s  +  l)ln(j  +  1)  =  )XlnX  +  ^  (X  +  A)ln(X+A)j  — 7(X  +  <?A)lnA 

A  v.  J  A 


and 


Therefore 


(. s  +q)\n(s  +q)  =  \-Ck  +  qA  )ln(X  +  <?A )  — ^-(X+<?A)lnA  . 

A  A 

CSum  2  *fa  [(1-4  )XlnX  +  q  (k  +  A  )ln(X  +  A )  -  (X  +  qA  )ln(X  +  qA )] 
=  x  €fflP(.D )  [e  [(X  + X)In  (X+X)]-(E[X}+ \)ln  (E  [X  ]  +  X) 

=  X  €sP2(P)I[X  +^1 


This  proves 

Theorem  2:  For  the  Poisson  channel  with  mean-square-constrained  encoder  intensity  EIX/2]—-^ 
and  noise  intensity  X, 

f[NFO  -  4-INFO  =  C CODING  =  ^CODING  =  D(X,P)  . 

The  capacities  in  Theorem  2  are  all  capacities  of  the  Poisson  channel  with  causal  feedback.  How¬ 
ever,  as  observed  in  [5],  Cinfo  and  CS?o  are  also  no-feedback  capacities.  Therefore  Ccoding  and  CcodLvg 
are  also  no- feedback  capacities. 


I 


I 

Other  Remarks 


We  are  justified  to  an  extent  by  Theorem  2  in  speaking  of  the  capacity  of  the  mean-square- 
constrained  Poisson  channel  without  using  qualifying  descriptors;  for  nonrandom  noise  intensity  X  all 
the  capacities  -  information  and  coding,  with  and  without  OOK  constraint,  with  and  without  causal 
feedback  -  are  the  same.  The  availability  of  the  expression  (’4)  for  the  channel  capacity  enables  us  to 
consider  questions  of  time- varying  channel  parameters  X(t)  and  P  (f  )■  Strictly  analogous  to  proof  for  the 
peak-constrained  Poisson  channel  in  [3],  it  can  be  shown  that  for  b  -measurable  X(t)  and  P  (/) 

,  T 

£Wo  =  jD(Ut\P(t))b(dt).  (21) 

1  o 

It  is  easily  shown  that  (21)  is  the  information  capacity  with  or  without  causal  feedback.  Further  restric¬ 
tions  on  X(t)  and  P(t)  to  provide  for  stationarity  such  as  periodicity  or  almost  periodicity  allow  expres¬ 
sions  analogous  to  (21)  to  be  given  for  Ccoding-  See  [4]. 

Given  (21)  for  the  information  capacity,  it  is  possible  to  give  a  treatment  of  jamming  along  the 
lines  given  in  [3]  for  the  peak-constrained  Poisson  channel.  We  take  as  our  jamming  model 

T),  =Xt  +X(t)+J, 

where  T],  is  the  intensity  of  the  channel  output  and  J,  is  the  intensity  -  possibly  stochastic  -  of  the 
Poisson-type  process  G,  jamming  the  channel.  Also,  we  make  the  assumptions  that  G,  is  independent  of 
the  message  and  noise  processes  0,  and  N,  and  that 

l  7 

-~:\E[Jt}b(dt)<Pj. 

1  o 

Under  these  assumptions  it  can  be  shown  that  from  the  standpoint  of  minimizing  information  capacity, 
the  optimal  jamming  intensity  is  nonrandom  and  follows  the  form  of  a  vuterfilling  schemum  [2];  i.e., 

jr  =  {aP(t)-X(t)}+ 


where  a>0  satisfies 

,  t 

*=rf[ctP  (t  )-\(t  )]+b(dt )  =  Pj  . 

1  o 

It  is  hoped  that  knowledge  of  the  form  of  D{X,P)  given  here  will  also  shed  light  on  other  ques¬ 
tions  relating  to  channel  error  exponent,  random  noise  intensity,  and  marked  Poisson  channels. 
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